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Abstract. The edit distance between two graphs on the same labeled 
vertex set is the size of the symmetric difference of the edge sets. The 
edit distance function of hereditary property, H, is a function of p G [0, 1] 
and is the limit of the maximum normalized distance between a graph 
of density p and H. 

This paper uses localization, for computing the edit distance function 
of various hereditary properties. For any graph H, Forb(//) denotes the 
property of not having an induced copy of H. We compute the edit 
distance function for Forb(_H'), where H is any so-called split graph, and 
the graph Hg, a graph first used to describe the difficulties in computing 
the edit distance function. 



1. Introduction 

This paper uses the method of locahzation, introduced in [12] as a way 
to compute edit distance functions. It uses some properties of quadratic 
programming, first applied by Marchant and Thomason [TT]. Some results 
on the edit distance function can be found in a variety of papers [HI [5l [6l [U 
[21 [21 m [lOl [HI [13]. Much of the background to this paper can be found in a 
paper by Balogh and the author [7J . Terminology and proofs of supporting 
lemmas that are suppressed here can be found in [12] . 

1.1. The edit distance function. A hereditary property is a family 
of graphs that is closed under isomorphism and the taking of induced sub- 
graphs. The edit distance function of a hereditary property Ti, denoted 
ed-}{ (p) , measures the maximum distance of a density p graph from a hered- 
itary property. Formally, if Dist(G,H) = mm{\E{G)AE{G')\ : \V{G')\ = 
n,G' £ n}, then 
(1) 

ed-uip) = lim max|Dist(G,?^) : |y(G)| =n,\E{G)\ = 

In [7] , a result of Alon and Stav [Ij is generalized to show that the limit in 
([T]) does indeed exist for nontrivial hereditary properties and, furthermore, 
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that 

edn{p)= lim Dist(G(n,p),^)/(^). 

That is, the maximum edit distance is achieved, asymptoticahy, by the 
Erdos-Renyi random graph G{n,p). For any nontrivial hereditary prop- 
erty Ti (that is, one that is not finite), the function ed-^ij)) is continuous 
and concave down. Hence, it achieves its maximum at a point {p^^d'^. 
It should be noted that, for some hereditary properties, might be an 
interval. 

1.2. Main results. The main results of this paper are Theorem[T]and The- 
orem O 

A split graph is a graph whose vertex set can be partitioned into one 
clique and one independent set. li H \s a, split graph on h vertices with 
independence number a and clique number uj, then a + oo ^ {h, h + 1}. The 
value of {p* , d* ) had been obtained for the claw by Alon and Stav [2j and for 
graphs of the form Ka + Ef, (an a-clique with b isolated vertices) by Balogh 
and the author [7j. 

Theorem 1. Let H be a split graph that is neither complete nor empty, 
with independence number a and clique number uj. Then, 

(2) edForb(H)(p) = min 

It is a trivial result (see, e.g., [12j) that edpavh{K^)ip) = P/{^ ~ 1) 
^^FoTh(K^)ip) ~ ~ P)/i'^ ~ So, we can combine Theorem jlj with the 
prior results for which H is either complete or empty. 

Corollary 2. Let H be a split graph with independence number a and clique 
number u. Then, {p*^,d*^) = 



P 



1 — p 
T' a-l 




Figure 1. The graph Hg. 
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The graph, Hg, as drawn in Figure 1.2 , was given in [7J as an example of a 
hereditary property Ti = Forh{Hg) such that the maximum value of ed-}{{p) 
cannot be determined by CRG^that only have gray edges. In [7| only an 

upper bound of min||, 2+^) provided for edpovh{Hg){p)- Here we 

determine the function itself. 



Theorem 3. Let Hg be the graph in Figure 1.2. Then, 

p p 1 — p 



ed] 



■Forb(/f9) 



(P) 



mm 



Consequently, (PporbC/fg)' 4orb(/f<j 



3'l+4p' 2 
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Figure 2. Plot of edForb{i^9)(p) = min{p/3,p/(l + 4p), (1 
p)/2}. The point {p*,d*) ■- 



8 ' 16 



is indicated. 



The rest of the paper is organized as follows: Section [2] gives some of the 
general definitions for the edit distance function, such as colored regularity 
graphs. Section [3] defines and categorizes so-called p-core colored regularity 
graphs introduced by Marchant and Thomason jllj . Section [4] describes 
the method we use (localization), which is similar to the symmetrization 
idea in Sidorenko [K] Section [5] proves Theorem [T] regarding split graphs. 
Section [6] proves Theorem [3] regarding the graph Hg. Section [7] is a section 
of acknowledgements. 



2. Background and basic facts 

2.1. Notation. All graphs are simple. If S and T are sets, then S + T 
denotes the disjoint union of S and T. If v and w are adjacent vertices in a 
graph, we denote the edge between them to be vw. 



CRGs are defined shortly in Section 2 
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2.2. Colored regularity graphs. A colored regularity graph (CRG), 

is a simple complete grapli, together with a partition of the vertices 
into black and white V{K) = YW{K) + YB{K) and a partition of the 
edges into black, white and gray E{K) = EW{K) + EG{K) + EB{K). We 
say that a graph H embeds in K, (writing H i— t- K) if there is a function 
if : V{H) V{K) so that if /ii/i2 G E{H), then either (p{hi) = (p{h2) G 
VB{K) or ip{hi)ip{h2) G EB{K) U EG(i^) and if /ii/i2 ^(i?), then either 
ip{hi) = ip{h2) £ VW{K) or ip{hi)ip{h2) G EW(K) U EG{K). 

For a hereditary property of graphs, Ti, we denote /C('H) to be the subset 
of CRGs such that no forbidden graph maps into K. That is, if J'i'H) is 
defined to be the minimal set of graphs so that Ti = r\H&T{n) Forb(i/), then 
JCin) = {K : H ^ K^'iH e T{n)}. A CRG K' is said to be a sub-CRG 
of K if K' can be obtained by deleting vertices of K. 

2.3. The / and g functions. For every CRG, K, we associate two func- 
tions. The function / is a linear function of p and g is found by weighting 
the vertices. Let K have a total of k vertices {vi, . . . ,Vk}, and let M.k{p) 
be a matrix such that the entries are: 

r p, if ViVj G YW{K) U EW{K); 
['M.K{p)]ij = { I-P, if ViVj G YB{K) U EB{K); 
( 0, if ViVj G EG{K). 

Then, we can express the / and g functions over the domain p G [0, 1] as 
follows, with VW = YW{K), VB = VB{K), EW = EW{K) and EB = 
EB{K): 

(3) Mp) = ■^[p(|VW|+2|EW|) + (l-p)(|VB|+2|EB|)] 

{min x"^Mx(p)x 
s.t. x'^1 = 1 
X > 

If we denote 1 to be the vector of all ones, then fxip) = i'^^)'^ ^k{p) 
So, fxip) > gxip)- 



Theorem 4 ([TJ). For any nontrivial hereditary property %, 

ednip) = lim = lini /^(p). 

KeK{n) KeK(H) 

2.4. Basic observations on ed}i[p). The following is a summary of basic 



facts about the edit distance function. Item (iii) comes from Alon and 
Stav [Ij. Item ([iv|) comes from [7]. 



Theorem 5. Let % he a nontrivial hereditary property with chromatic num- 
ber Xj complementary chromatic number x, binary chromatic number XB 
and edit distance function ec?-^ {p) . 

(i) Ifx>^, then edn{p) <p/{x- !)• 

(ii) Ifx> 1, then edn{p) < {l-p)/{x-l)- 
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(iii) edH{l/2) = l/{2{xB-l))- 

(iv) ed-}{{p) is continuous and concave down. 

(v) ednip) = edy-{l-p). 

3. The p- cores 
In Marchant and Thomason it is shown that 

ednip) = inf{gK{p) : K e JCCH)} = inf {/^(p) : K G JCCH)} . 

Although the setting of that paper is not edit distance, the results can be 
translated to our setting. They show, in fact, that edy^{p) = min {gxip) '■ K G K.{J-L)}. 
That is, for any hereditary property % and p G [0,1], there is a CRG, 
K G K.{l-L) such that ed-}i{p) = gxip)- This is found by looking at so-called 
p-cores. A CRG, K, is a p-core CRG, or simply a p-core, if gxip) < gK'{p) 
for all nontrivial sub-CRGs K' of K. Marchant and Thomason prove that 

ed-^ip) = min{gK{p) '■ K G K.{l-L) and K is p-core} . 

4. Computing edit distance functions using localization 

Upper bounds for the edit distance function of % are found by simply 
exhibiting some CRGs K G 1C{%) and computing gxip) by means of Q. 
The localization method obtains lower bounds for ed-}{{p) and is similar to 
symmetrization from Sidorenko . The main tools are Lemmas [g] and [Tj 
found in [12]. We have already seen much of the theoretical underpinnings. 

Given K, a p-core, there is a unique optimum weight vector, x, with all 
entries positive that is a solution to Q. For any vertex v G V{K), dciv) 
denotes the sum of the weights of the gray neighbors of v under x, d'w{v) 
the sum of the white neighbors (including v itself if its color is white) and 
dB(w) the sum of the black neighbors (again, including v itself if its color is 
black). Consequently, dG(w) + dwiv) + dB(f ) = 1. 

The fundamental concept is that we may, in many cases, assume the 
vertices are monochromatic (say, black) and all edges are either white or 
gray. The sizes of the gray neighborhoods are a function of the weight x{v). 
We formalize the observations below: 

Lemma 6. Let % he a nontrivial hereditary property and p G (0, 1), IC(T-L) 
the set of CRGs defined by %. Then, 

(i) ed-}{{p) = mm{gK{p) ■ K G IC{'H) and K is p-core}. 

(ii) If p < 1/2 and K is a p-core CRG, then K has no black edges and 
white edges can only he incident to black vertices. 

(iii) If p > 1/2 and K is a p-core CRG, then K has no white edges and 
black edges can only he incident to white vertices. 

(iv) //x is the optimal weight function of a p-core CRG K , then for all 
V G V{K), gxip) =pdw(w) + (1 -p)dB(f). 

The overall idea is that we need only consider p-core CRGs and their spe- 
cial structure, then a great deal of information can be obtained by focusing 
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on a single vertex. This is referred to as "localization" because we can focus 
on one vertex at a time. 

Lemma [7] has all of the elements to express dc^v) for any vertex v in a 
p-core CRG. It is often useful to focus on the gray neighborhood of vertices. 

Lemma 7 (Localization). Let p £ (0, 1) and K he a p- core CRG with opti- 
mal weight function x. 

(i) If P< 1/2, then, x{v) = gxip)/? for all v G YW{K) and 

dciv) = ^^^^^ + ^—^^{v), for allv £YB{K). 
p p 

(ii) Ifp> 1/2, then x(u) = -p) for all v G YB{K) and 

dciv) = ^-P-9k{p) ^ '^P:zl^(^^J)^ for all v G VW(i^). 
1 — p 1 — P 



Corollary 8. Let p G (0, 1) and K be a p-core CRG with optimal weight 
function x. 

(i) Ifp < 1/2, then x{v) < gK{p)/{l-p) for all v £ VB{K). 

(ii) Ifp> 1/2, then x{v) < gK{p)/p for all v G \W{K). 

Remark 9. From this point forward in the paper, if K is a CRG under 
consideration and p is fixed, x(i;) will denote the weight of v £ y{I^) under 
the optimal solution of the quadratic program in equation ^ that defines 
9K- 

One more useful observation is Theorem 6 from |12j : 

Theorem 10. A sub-CRG, K' , of a CRG, K, is a component if, for all 

V G V{K') and all w G V{K) - V{K'), then vw is gray. Let K be a CRG 
with components K^^\ . . . , K^^\ Then 

I 

i=\ 

5. Forb(//), U A SPLIT GRAPH 

We need to define a special class of graphs. For u > 2 and a nonnegative 
integer vector (w; ag, ai, . . . , a^j), a {uj; oq, ai, . . . , aLj)-clique-stai|^is a graph 
G such that V{G) is partitioned into A and W. The set A induces an 
independent set, the set W = {wi, . . . ,Wu)} induces a clique and for i = 
1, . . . , w, vertex Wi is adjacent to a set of + 1 leaves in A and there are oq 
independent vertices. Note that this implies that X^i^o o.i = a — lo. 

Colloquially, a clique-star can be partitioned into stars and independent 
sets such that the centers of the stars are connected by a clique and there 



We get the notation from Hung, Syslo, Weaver and West [9]. Barrett, Jepsen, Lang, 
McHenry, Nelson and Owens [S] define a clique-star, but it is a different type of graph. 
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are no other edges. (If one of the stars is i^2, one of the endvertices is 
designated to be the center.) Proving that Theorem [T] is true is much more 
difficult in the case where either H or its complement is a clique-star. 



5.1. Proof of Theorem [TJ Note that, because H is neither complete nor 
empty, a,u} > 2. Without loss of generality, we may assume that ui < a. 

Fact 11. For any split graph H , its chromatic number is its clique number. 
In notation, x{H) = oj{H). Consequently, x(i^) = a{H). 

Proof. Trivially, x{H) > u{H). Let W C V{H) be a maximum-sized clique 
in H. For every a G V{H) — W, there is some vertex w £ W such that a is 
nonadjacent to w. Thus, w and its nonneighbors form an independent set 
and the union of such independent sets cover V{H). So, x{H) = \W\ = oj. 
Because H is also a split graph, x{H) = oj{H) = a{H). □ 



Let K{w,b) denote the CRG with w white vertices, b black vertices and 



all edges gray. An immediate consequence of Fact 11 is that H cannot be 
embedded into K{u) — 1, 0) and K{0, a — 1) and so 



edForb(H){p) < ^^T^{9K{uj-i,o){p),9K(o,a-i){p)} = min 



p 1 — p 



CO — 1 a — 1 



Let K G /C(Forb(i7)) be a p-core CRG and denote g = gxip)- By 
Lemma [6| any edge between vertices of different colors must be gray. Since 
-fT is a split graph, H would embed into any K with a pair of different-colored 
vertices. So, the vertices in K must be monochromatic. Furthermore, if K 
has only gray edges, then either K has at most a; — 1 white vertices or at 
most a — 1 black vertices. In particular, if p = 1/2, then all edges must be 

r 1/2 1/2 1 

gray and so ec?Foj.b(/i')(l/2) = min < j^j^j, ^j^-j- >. Because we have assumed 
that uj < a, and concavity comes from Theorem Ispvl) , it is the case that 



1 — p 



lip < 1/2 and K has white vertices, then Lemma[6][|u]) gives that all edges 
must be gray. In that case, gxip) = ^Ei- So, we may assume that p < 1/2 
and K has only black vertices and only white or gray edges. 

By virtue of the fact that a clique and independent set can intersect in at 
most one vertex, h<a + uj<h + l. This yields two cases. 



Case 1. a + uj = h + 1. 



Let f be a largest-weight vertex and let x = x(t>). If v has h — u = a — 1 
gray neighbors, then H ^ K. 



8 RYAN MARTIN 

Thus, because x is the largest weight, Lemma [7][|i]) gives that 

dG(w) < (a - 2)x 

i X < (a — 2)x 

p p 

p — g < {pa — l)x. 

If p < 1/a, then g > p > p/{ll) — 1). If p > then Corohary [sjji]) gives 
that 

p-g < (pa-l)—^ 
1 — p 

p{f-p) < gp{a-l) 
l-p 

T - 3- 

a — 1 

This concludes Case 1. 
Case 2. a + co = h. 



i) 



Let p € ^0, . Let f be a vertex of largest weight x = x(i;). Lemma 
gives that 

dG(^) < {h — uj — l)x 
p-g l-2p 

1 x < [a — Ijx 

P P 

p-g < (p(q! + 1) — 1) X. 

Ifp < 1/(q + 1), then g > p> p/{oj-l). Ifp > l/(a + l), then Corollary [IP 
gives that 

p- g < {p{a + l) - 1) ^ 



1 — p 
Then, 

-, -1 UJ—l -, CJ— 1 

1 . p 

9 > > = -, 7 = 7 ^ 



a a h — 1 u) — 1 CO — 1 

Finally, we may assume that p G 5^ • We have to split into two cases 

according to the structure of H. 

Case 2a. a + u = h and there exists an c < a; — 1 such that H can be 
partitioned into c cliques and an independent set of a — c vertices. 

Because all vertices are black and all edges are either white or gray in K, 
there cannot be a vertices such that there are w — 1 of them are adjacent 
to all of the a — 1 others via a gray edge. We will show that this condition 

requires that g > min | , | . 
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First, we show that if g is smah then K must have a gray (w — l)-cnque. 
Let vi, . . . ,Vi be a maximal gray cUque. That is, any edge between these 
vertices is gray and every vertex not in {vi, . . . ,V£} has at least one white 
neighbor in {vi, . . . , vi}. Let Xj = x(z;j) for i = 1, . . . , ^ and let X = J2i=i ^i- 

Using Lemma iTlpl) and maximality, 



Y,[^G{V^) - X + X,] < {l-l){l-X) 
1=1 

i^^ + ^—^x-cX < {e-i){i-x) 
p p 

p-£g < {2p-l)X. 

Hence, either g > p/{uj — 1) or K has a gray (w — l)-clique. We may 
thus suppose that K has a gray (w — l)-clique. Let one with maximum 
total weight be {vi, . . . ,v^-i} with Xi = x{vi) for i = — 1 and 

X = Yl'i=i -"^i- Further, let Y be the sum of the weights of the common 
gray neighbors of xi, . . . , x^j-i- Since X has the largest weight of any gray 
{io — l)-clique and Y has at most a — oj vertices, 

Y<{a-uj) ^ 



UJ 



Therefore, 



Y,[^G{vi) - X + Xi] < {uj-l)Y + {oj-2){l-X-Y) 



i=l 



{co-l)(^?-j^-x] + ^—^X < Y+{u-2){l-X) 
{l-X)-{uj-l)^ + ^—^X < {a-u)-^ 



p 




1 - 


p 


p 




h- 


2 


OJ — 


1 



p p UJ — 1 

\p OJ — 1 J p 

If p < (w — — 2), then the term in parentheses in ([s]) is positive and 
g>p/{uj-l). Up> (tj-l)/(/i-2), thenX < (w -1)3^ by Corollary |8||i|). 
Hence, 

g /I h-2\ oj-l 
1 — p \p OJ — 1 J p 



1 < g 



uj-l 1 h-2 



p p{l-p) {uj-l){l-p) 



^ ^ a-l^Lj-2 (uj - 1) - p{h - 2) 



1 — p u} — I p{l — p) 

Therefore, (7 > (1 — p)/{a — 1). This concludes Case 2a. 
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Which graphs are in Case 2, but not Case 2a? Since a + u = h, every 
w £ W has at least one neighbor in A. If any a £ A has more than one 
neighbor in W, then we can greedily find at most uj — 1 vertices in A such 
that the union of their neighborhoods is W. Such a graph would be in Case 
2a. 

So, the graphs, H with to < a that are in neither Case 1 nor Case 2a have 
the property that N{w) D N{w') n A = for all distinct w, w' G W. This is 
exactly the case of a clique-star. 

Case 2b. a + uj = h and G is a clique-star. 

Let W = {wi, . . . , Wu)} such that Wi has + 1 neighbors in A for i = 
1, . . . ,uj and there are oq isolated vertices. 

Fact 12. If UJ > 2 and H is a (a;;ao, • . . ,acj) -clique-star and K is a black- 
vertex CRG such that either 

• there exists a vertex with at least a gray neighbors, or 

• there exist vertices vi, . . . ,Vi^ such that 

— {vi, . . . , Vuj} is a gray clique, 

— for i = 1, . . . ,uj — 1, Vi has a — 1 gray neighbors, and 

— Vuj has at least [{a — uj)/uj\ -\- uj — 1 gray neighbors (including 
vi, . . .,v^_i). 

Then, H ^ K. 

Proof of Fact\lS\ If K has a vertex, v, with a gray neighbors, then W can 
be mapped to v whereas each member of ^ = V{H) — W can be mapped 
to a different gray neighbor of v. Thus H ^ K. So, we may assume the 
maximum gray degree of K is at most a — 1. 

Without loss of generality, let ai > • • • > a^. Our mapping is done 
recursively: Map w^^ and one of its neighbors to v^. Map its remaining 
^-neighbors (ooj < [(a — uj)/uj\ of them) to each of gray neighbors of v^^ 
that are not in {f i, . . . , v^-i}- 

Having embedded w^, . . . ^Wi^i and each of their respective A-neighbors 
into a total of at most '}2^=i+i i^j + 1) vertices of K,we map vui and one of its 
74-neighbors into Vi and its remaining Oj j4-neighbors into arbitrary unused 
gray neighbors of Vi. After t^i and its neighbors are mapped, we map the 
remaining oq isolated vertices arbitrarily into the vertices of K that were 
not already used. 

This mapping can be accomplished because the fact that each of the Vi 
have at least a — 1 gray neighbors ensures that, even at the last step, when 
wi and a neighbor is embedded, there are at least a — 1 gray neighbors of vi. 
The number of gray neighbors of vi that were used are the uj — 1 vertices Vi 
and at most Yl'j=2 = ct — uj — ai— qq others, for a total of a — 1 — ai — cq. 
So, there are enough gray neighbors of vi to embed the oi neighbors of wi 
as well as the isolated vertices. Thus, H ^ K. □ 
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Fact 13. Let p e (0,1/2) and let K be a black-vertex CRG. If gK{p) < 
mill {p/ (w — 1), (1 — p)/(a — 1)}, then either 

• K ^ K{Q,a-l), or 

• there exists a vertex with at least a gray neighbors, or 

• there exist vertices vi, . . . ,Vaj such that 

— {vi, . . . , Vu)} is a gray clique, 

— for i = 1, . . . ,u — 1, Vi has a — 1 gray neighbors, and 

— Vuj has at least [(a — u})/uj\ + u — I gray neighbors (including 
Vl,. . . ,v^-i). 



Proof of Fact\T^ Assume that no vertex has a gray neighbors. We find 
Vl, . . . ,Vuj greedily. Choose vi to be a vertex of largest weight. Stop ii i = uj 
or if NG{vi)ri - • - nNGivi) is empty. Otherwise, let Uj+i be a vertex of largest 
weight in that set. We will show later that this process creates at least uj 
vertices. 

First, we find the number of gray neighbors of vi, using the fact that xi 
is the largest weight. 



\Ng{vi)\ > 



dcivi) 



Xl 



> + l-2p 



pxi 



P 



Using Corollary |8]|I| , we have that xi < g/{l — p) and so 



|iVG(^i)|> ^ ^ ^ >«-2. 



Equality only occurs if 5 = (1 — p)/ioi — 1) there are a — 1 vertices, all of 
weight l/(a — l), thus -fC « K(0,Q — 1). So, we may assume |A''g(ui)| > a — 1. 

For i £ {2,...,a; — 1}, we let X = xj and consider the common 

gray neighborhood of {vi, . . . , Vi}. Its total weight is: 

(6) dG{v^) -{X- X.) - 1^ X {Nw{v,)) = + ^-^X > 0, 

because i<uj — l,g<p/{uj — \),p< 1/2 and X > xi > Thus, Vj+i must 
exist. 

We use these calculations to obtain the size of Ncivi) for i = 2, . . . , cj — 
1. First note that Vi has i — 1 gray neighbors among {vi, . . . ,Vi-i} and 
that every vertex that is a gray neighbor of each ol vi, . . . ,Vi has weight at 
most Xi. As to the remaining vertices, partition the vertices in Ncivi) — 
{vi, . . . , Vi-i} — n}=i ^civj) according to the least index j for which the 
vertex is adjacent to Vj via a white edge. By the choice of vi, . . . ,Vi, such a 
vertex has weight at most xj = x(fj). Consequently, we have a lower bound 
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for |iVG(^^*)|: 
\NG{vi)\ > + 



x{NGivi)n---nNG{vi)) 



X {Ncivi) n Nwivj) n {Ng{vi) n • • • n A^G(^^i-i)}) 



We can drop the ceilings to obtain the lower bound 
\NG{vi)\ > {i-l) + -^{NG{vi)n---nNG(,v,)) 

Xi 

i-l ^ 

+ ^ — X [Ncivi) n Nw{vj) n {Ng{vi) n • • • n NG{vj-i)}) . 

Now we look at the coefficients ^ < ^ < ■ ■ ■ < The total weight of 
gray neighbors with coefficient ^ is at most x {Nw[vi)). The total weight of 
gray neighbors with coefficient ^ or ^ is at most x {Nw{vi)) + x {Nw{v2)) 
and so on. Therefore, we may further lower bound \NG{vi)\ as follows: 



i-l 

\NG{vi)\ > {i-l) + Y,-^iNw{vj)) 



+ 



X (NGivi)) - (X - Xi) - E;=i X (Nwivj)) 



and observe that inequality ([g]) shows that the last numerator is nonnegative. 
Using similar computations as before, the previous inequality simplifies 

to 



i-l 



\NG{vi)\ > {i-l) + Yl 



I ( g 1-p 



+ 



\ Xj \P P 

1 ( P- 9 , 1 - 2p 

H i 



i-l 



Xi \ p p 
After some simplification 



g 1-p 
p p 



P ^Xj p pXi p \Xi 



Using Jensen's inequality, we see that 
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So, we return to ([T]) and subsequently use the fact that xi < X/i: 



\NGivi)\ > 



> 



P \X 
gf^ _ 
pX 



l-2p ^. _ ^ p-{i + l)g ^ l-2p fX-^ 



P 



1 - 2p 
P 



pxi p \XiJ 

p-{i + l)g + il-2p)X 
p{X/i) 



Kp ~9) ^ l-2p 



pX p 

Using the fact that X < ig/{l — p) from Corollary [8][|i]) , we see that 

\NG{vi)\> ^~^~^ >a-2. 
9 

Equality only occurs \i g = {1 — p) / {a — \) and K « K(0,a — 1). So, we 
may assume \NG{vi)\ > a — 1 for i = 1, . . . , a; — 1. 

Finally, we try to determine the number of vertices adjacent to via a 
gray edge. We only need \Ng{vu])\ > [o^/'^J + w — 2 in order to finish the 
proof. First, note that the very existence of ensures that \Ng{v^^)\ > oj—1. 
Thus, we may assume that a > 2a;. 

Second, suppose that w > 3. Recalling that dG(i'w) = + ^—^x^^ and 
^ the pigeonhole principle gives that for v = (indeed, for any 
vertex v), 



\NGiv)\ > 



P- 9 1 -P 
P 9 



> < 



> 



i-P 
(i-p)/(«-i) 



if r, < yni- 

-f^ - /i-2' 



(a - 1 



U)-l 



Let a = quj + r with < r < u; — 1 and note that since a > 2a;, q > 2. 
Hence, 



\Ng{v)\ > 



(a 



1 



j{uj - 2) + 



CO - 1 

{q + r-l){uj-2) 



u> - 1 

> g(a;-2) + l 
= g + a;-2 + (g-l)(a;-3) 

Since q = [a/a;J, we may conclude that \Ng{v)\ > [a/uj\ + a; — 2, just as 
desired. 

Third, let a; = 2; i.e., H is a double-star (possibly with isolated vertices). 
Recall that a > 2a; = 4. Our goal is to show that \Ng{v2)\ > [oi/uj\ + a; — 



14 



RYAN MARTIN 



2 = [a/2j. The computations are, by now, routine. We use the fact that 
X2 > dcivi)/ (a - 1) and xi < g/{l - p). 



\Ng{v2)\ > 



dG(^^2) 



> 



> 



> 



> 



1 f p- g , i-2p 

— \ X2 

Xl \ p p 

J_ / P- 9 _^ l-2p _ dcCvi) Y 
Xl \ p p a — 1 J 



p-g 

pxi 



1 + 



P 

1 - 2p 

p{a - I] 



+ 



l-2p 



P 



(P-5')(1-P) /p(a-3) + l 



P9 



p{a - 1) 



+ 



a — 1 



1 - 2p 
p 



a — 1 



Recalhng that, in the case of uj = 2, g < min{p, (1 — p)/{a — 1)}, 



\Ng{v2)\ > < 



l-2p 



1 

a-1 



pa—1 ( p(a— 3)+l 



+ 



l-2p 



1 



if p < 1/a; 
if p > 1/a. 



l~2p 



a 



1 

a-l 

(l-2p) 
p(a-l) 



if p < 1/a; 
if p > 1/a. 



In each case, the smallest value of the expression occurs when p = 1/a, 
giving 

1 



\Ng{v2)\ > 



a — 3 + 



a — 1 





a 




'a' 




= a-2 = 




H 








.2. 


2 





This is at least [a/2j since q > 4. This concludes the proof of Fact 13 □ 

Summarizing, ifH^K, then either g > p/{uj — 1) or g > (l—p) /{a — 1). 
This concludes the proof of Theorem [T} 



were 



5.2. Examples of split graphs. Items ^ and ^ in Corollary 14 
proven in [7]. 

Corollary 14. Let H be a graph on h vertices. 

(i) If H ^ Ka + Eb, then edporbC/f) (p) = min { ^ , ^ } • 

(ii) If H is a star (i.e., H Eh-iVKi), then e(iForb(H) (p) = x^}' 

(iii) If H is a double-star (i.e., there are adjacent vertices u and v to 
which every other vertex is adjacent to exactly one), then ec?poi.b(//) (p) = 



mm ■ 
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6. Forb(i?9) 

Marchant and Thomason ^iTj give the example of "H = Forb(Cg), where 
Cq is a 6-cycle with an additional diagonal edge, such that ed-}i{p) is not 
determined by CRGs with all gray edges. More precisely, they prove that 



ec^Forb(c;)(p) = niin 



p 1 — p 



l + 2p' 2 



The CRG which corresponds to gxip) = (1 -p)/2 is K{0, 2). The CRG, K, 
which has gxip) = + 2p) for p £ [0, 1/2] consists of three vertices: two 
black vertices connected via a white edge and a white vertex. The remaining 
two edges are gray. 



The graph Hg, shown in Figure 1.2 and cited in f^, generates a hereditary 



property H = FoTh{Hg) such that cannot be determined by CRGs of the 
form K(a, c). Note that dFoj.b{c*) ^® determined by such CRGs, but the 
part of the function for p G (0, 1/2) cannot. 

6.1. Proof of Theorem [sj Upper bound. We know that x{Hg) = 4 so 
let = K{3, 0) where gj^(i) (p) = p/3. We also know that x(^) = 3 
so let K^^^ = K{0,2) where 5^(4) (p) = (1 -p)/2. In [7J, another CRG 
in }C{FoTch{Hg)) is given, call it K^'^\ It consists of 4 white vertices, one 
black edge and 5 gray edges. It has edit distance function gj^(2) (p) = 
mm{p/3,p/ (2 + 2p)}. 

There is a CRG with a smaller g function. We call it K^^\ it consists of 
5 white vertices, two disjoint black edges and the remaining 8 edges gray. 



The function gj^(2) {p) can be computed by use of Theorem 10 In the setup 
of that theorem, K^'^^ has 3 components. Since the components have g 
functions either p (for the solitary white vertex) or min{p, 1/2} (for each of 
the other two components), the theorem gives that 

5i^(3)(p)"' =P"'+2(min{p,l/2})"i = (min{p/3,p/(l + 4p)})-^ 

It is easy to see that Hg and Hg K^^\ In 17|, it was shown that 

Hg^K^^\ To finish the upper bound, it remains to show that Hg y/^ K'^^^ . 
Let vq be the isolated vertex, {vi,wi} be a black edge and {v2-,W2} be a 
black edge. 

First, we show that no component of X^^-* can have 4 vertices from Hg. 
Since there are no independent sets of size 4 and no induced stars of size 4, 
the only way to have a component of size 4 is to have an induced copy of C4 
in the component consisting of, say, {v2,W2}- It is not difficult to see that 
deleting two vertices from the set {0, 3, 6} yields a C4-free graph. So, any C4 
contains exactly two members of {0,3,6}. Without loss of generality, the 
induced C4 is {1,3,6,8}. But the graph induced by {0,2,4,5,7} induces 
a C5, which cannot be mapped into the sub-CRG induced by {fo, f 1, ifi}. 
Therefore, if Hg were to map to K^'^\ each component must contain exactly 
3 vertices. First we map to vq. The only independent sets of size 3 are 
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{1, 4, 7} and {2, 5, 8}. Without loss of generahty, assume the former. Second, 
we consider the graph induced by {0,2,3,5,6,8}. Any partition of these 
vertices into two subsets of 3 vertices either has a triangle or a copy of P3, 
neither of which maps into {vi, wi} or {v2,W2}- So, these six vertices cannot 
be mapped into {vi,wi,V2,W2}- Hence Hg y/^ K'^^^ . 

The CRGs K^^^ , K^^'^ and K^^-* give an upper bound of min 1 1 , j^p^ , | . 

Lower bound, for p < 1/2. Let K be ap-core such that Hg yfr K. \i K has 
at least 2 white vertices, then it has no black vertices because Hg 1— )• K{2, 1). 
(The independent sets are {1,4,7} and {2,5,8} and the clique is {0,3,6}.) 
So, in this case gxip) > p/3 with equality if and only li K 7^ K{?,, 0). 

If K has exactly one white vertex, then there is no gray edge among the 
black vertices because Hg 1— )• K{1,2). (The independent set is {2,7} and 
the cliques are {0,1,8} and {3,4,5,6}.) Let w be the white vertex and 
K' = K — {w} and k' = \ V{K')\. Since K' is a clique with all black vertices 



and all white edges. Proposition 8 from 
9k' {p) = P + ^-Tj^ > P- By Theorem 10 



12] gives that, for p £ (0,1/2), 
^, ^ ^. .......... ^^ gK{p) > l/{l/p+l/p) = p/2, 

which is strictly larger than edpoj-^H^^ipYloic p £ (0, 1/2]. 

If K has no white vertices, then let vq be the vertex with largest weight 
and let vi be a vertex in the gray neighborhood of vq. Let xq = x{vq) and 
xi = x(vi). Since K can have no gray triangles {Hg can be partitioned into 
3 cliques), dcivo) + dcivi) < 1. 

1 > dG(wo) + dG(l'l) 

> 2 \ (xo + xi) 

p p 

p l-2p p 

9 > 2+ 2 (^0 + ^1) -2" 

Equality only occurs if p = 1/2. 

Summarizing, if p < 1/2 and K is a p-cove such that H y/^ K, then 
9k{p) ^ p/3 with equality only \i K K{2>, 0). 

Lower bound, for p > 1/2. Let K he a. p-core such that Hg y/^ K. UK 
has at least 2 black vertices, then there are no white vertices because Hg 1— )• 
i^(l, 2) and so gxip) > (1 — p)/2 with equality if and only K « K{{), 2). 

If K has exactly one black vertex, then there is no gray edge among 
the white vertices because Hg 1— )• i^(2,l). Let b be the black vertex and 
K' = K—{1)} and k' = \V{K')\. Similar to the above, Proposition 8 from [1^ 
can be used to show that, for p G (1/2, 1), gK'{p) = l—p+ ^^r^ > 1—p. By 
9k{p) > (1 — p)/2, which is strictly larger than edForh{Hg){p) 
I). 



Theorem 



10 



for p G [1/2 

From now on, we will assume that K has only white vertices and, since 



it is p-core for p > 1/2, all edges are black or gray. Fact 15 and Fact 16 
establish some of the structural theorems. 



ON THE COMPUTATION OF EDIT DISTANCE FUNCTIONS 



17 



Fact 15. Let p G [1/2, 1) and K be a p-core CRG with white vertices and 
black or gray edges. Let v and v' be vertices connected by a gray edge. Then, 
Nq{v) n Nq{v') has at most two vertices. 

Proof. If Ng{v) n Ng{v') has three vertices, then map Hg vertices 0, 3 and 
6 to each of them, map {1,4,7} to v and {2,5,8} to v' . This is a map 
demonstrating that Hg ^ K. □ 

Fact 16. Let p G [1/2, 1) and K be a p-core CRG with white vertices and 
black or gray edges. Let vq be a vertex of largest weight and vi be a vertex that 
has largest weight among those in Ng{vq). Then, either Ng{vq) H Ng{vi) 
has exactly two vertices or gxip) > {1 — p)/2 or gxip) > p/3 with equality 
if and only if K k, K{2>, 0). 



Proof. Let g = gxip)- If the statement of Fact 16 is not true, then NG{vQ)r\ 
Nq{vi) has at most one vertex which, by the choice of vi, has weight at most 
x(ui) and, by inclusion-exclusion, has weight at least dG(t'o) + dG(^'i) — 1- 
Therefore, 

x(fi) > dclwo) + dG(t'i) - 1 

1 — p — q 2p — 1 , , , , 
1 — p 1 — P 

(8) g > + ^— x(i^o) —x{vi). 

Up> 2/3, then g > {l-p)/2. If p < 2/3, then use x(i;i) < x(uo) in ^. 

(9) s > + 5E^x(„0 

If p > 3/5, then g > {1 — p)/2. If p < 3/5, then use the fact that Corol- 
lary [8|[ii]) gives x(i'o) < g/p, which we use in ([9]). 

l-p 5p-3 .^l-p5p-3fg 



2 2 ^ ' - 2 2 \p 

a > -. 
y _ 3 

It is easy to see that equality can only occur if x{v2) = x{vi) = g/p = 1/3 
and their common gray neighborhood is a vertex of weight \ — 2g/p = 
1/3. □ 



Given Fact 15 and Fact 16, we can identify vq, a vertex of maximum 
weight, vi a vertex of maximum weight among those in Ng{vq) and {w2, "^2} = 
-^g(^^o) n Nq{vi). Without loss of generality, let x(ti2) > :x.{w2). For ease of 
notation, let Xi = x(t;j) for i = 0, 1, 2. li Ng{vq)^Ng{v2) — {vi} is nonempty, 
then let its unique vertex be denoted wi. (Uniqueness is a consequence of 
FactfTsl) 



Case 1. The vertex wi does not exist. 
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Most of our observations come from inclusion-exclusion: l^l + \B\ = \ AL) 
B\ + \ Ari B\. Inequality (10) comes from the fact that Ng{vo) n Ng{vi) = 
{v2,W2}- Inequality (11) comes from the fact that Nq{vo) r\NQ{v2) = {vi}. 
Hence, 



(10) 
(11) 



dG(vo) + dG(i'i) < 1 + 2x2 
dG(^^o) + dG(l'2) < l + Xi. 



Solve for X2 in each case, recalling that Lemma l7]|ii]) gives that dQ{v2 



^-p-g _|_ 2p-i 



i-p 



X2- Inequality (10) gives a lower bound for X2 and inequality 



(11) gives an upper bound: 

1 1 — P f 1 — P — 9 
- (dG(vo) + dcivi) - I) <X2< 1 + xi - dG{vo) 

2 2p — \ \ 1 — p 

Some simplification gives 

2g > dcivo) + {2p - l)dG{vi) - 2{l - p)xi - 2p + 1 



> 2p 



1 — p — g 2p — 1 



l-p 



1 — p 



xo + 



2p^ 



1 



1 — p 



-xi — 2p + 1 



l-p 2p-l 2p2 - 1 

9 > \ 7^ — xo-\ Xi. 



If 2p2 - 1 > (i.e, p > I/V2), then g > {1 -p)/2. Otherwise, we use the 
bound xi < Xq. 

l-p 2p-l 2p2 - 1 
9 > — \ — ^0 H :^ — xq 



> 



2 

l-p 



+ {p'^ +p - l)xo. 



lip^ +p-l > (i.e, p> (\/5 - l)/2) , then g> {l-p)/2. Otherwise, we 
use the bound from Corollary l8||ii]) that xq < g/p. 



l-p 2 
g > — — ^{P +p- l)a;o 



> 
> 



2 

l-p 



g 



„ +(r+P-i, 
2 p 

p 



2{l+p)' 



Equality occurs only xq = xi = g / p and :x.{w2) = X2 = 1/2 — g/p. This is 
precisely the ORG denoted K^'^\ 

Case 2. The vertex wi exists. 



Inequality (12) comes from the fact that Ng{vo) C) Ng{vi) = {^2,^2} and 
x(u'2) < x{v2) = X2. Inequality (13) comes from the fact that Ng{vo) H 
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Ng{v2) = {^'i,^^i} and x(?i;i) < x(t'i) = xi. Observe that x(t(;2) < X2 and 
x(ri;i) < xi, hence, 

(12) dG(^;o) + dG(i;i) < 1 + 2x2 

(13) dG{vo) + dG{v2) < 1 + 2x1. 



Adding \12\ and (13) gives 



2dG(^;o) + dG(i^i) + dG(v2) < 2 + 2(xi+X2) 



(14) 



2dG(wo) 



2g 
1 — p 



< 



3-4p 



1 



P 



Xl + X2). 



lip> 3/4, then (14) gives that 2dG(wo) - 1^ < 0. Consequently, 2-1 _ 

< and so g > (1 — p)/2 easily. Thus, we assume p < 3/4. 
Next, we use Fact|17|to conclude that vq is the only common gray neighbor 
of vi and V2- 

Fact 17. Let p > 1/2 and K be a p-core with white vertices and black or 
gray edges. Let oq, ai, 02, 601 ^i) ^2 S ^{K) such that {00,01,02} is a gray 
triangle and {bi,aj} is a gray edge as long as i and j are distinct. Then, 
Hq ^ K. 



Proof. The following map shows the embedding: 



2, 7 H> oo 
60 



1, 5 I— )• oi 
3 61 



4, 8 H> 02 
6 i-> 62. 



□ 



If vi and V2 have a gray neighbor in K other than vq, call it wq and 
observe that by setting Oj 
imply that i— )• K. 

Since vq is the only common gray neighbor of vi and f 2 



Vi and hi := Wi for i = 0, 1, 2, Fact [17] would 



(15) 



dG(l'l) + dG('U2) < 1 + xo 
2j9- 1 



1 



-(X1+X2) < l + xo-2 



1 -p-g 
1 — p 



Inequality (14) gives a lower bound for X1+X2 and inequality (|15|) gives an 
upper bound. Recall that Lemma 7lii) gives that dG(?j) = ""^Y-p" 
for any vertex v G V{K). Recall that we assume p < 3/4. 



^ + ^x(^) 



1 — p 
3-4p 



2dG(t;o) 
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< Xl + X2 < 



1 — p 



1 + xo - 2 



1 -p-g 
l-p 



1-pJ ~ 2p-l 
Some simplification gives 

2{2p - 1) ((1 - p)dG{vo) - fi) < (3 - 4p) ((1 -p){l + xo) - 2(1 - p - 5)) 
and so 



9> 



1 



p ^ 4p 



p — 1 



-Xq. 
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If V - p- 1 > (i.e, p > (\/l7+ l)/8), then g > {I -p)/2. Otherwise, 
we use the bound xq < g/p from Corohary [8]||ii]) . 

^ - 2 2 Vpy 

~ 1 + 4p 

Equahty occurs only if xq = g/p, xi = X2 = and x{wi) = Xi for i = 1,2. 

This is precisely the ORG denoted K^^\ 

Therefore, for p G [1/2, 1] and in each case, g > min {p/ (1 + Ap), (1 — p)/2}. 
Combining this with the fact that for p € [0, 1] that g > p/3. This 
concludes the proof of the lower bound. Consequently, edForh{Hg){p) = 
min{p/3,p/(l + Ap), (1 — p)/2}. This concludes the proof of Theorem [sj 
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